Two Dimensional Yau-Hausdorff Distance with Applications on Comparison of DNA and Protein Sequences

نویسندگان

  • Kun Tian
  • Xiaoqian Yang
  • Qin Kong
  • Changchuan Yin
  • Rong L. He
  • Stephen S.-T. Yau
  • Yang Zhang
چکیده

Comparing DNA or protein sequences plays an important role in the functional analysis of genomes. Despite many methods available for sequences comparison, few methods retain the information content of sequences. We propose a new approach, the Yau-Hausdorff method, which considers all translations and rotations when seeking the best match of graphical curves of DNA or protein sequences. The complexity of this method is lower than that of any other two dimensional minimum Hausdorff algorithm. The Yau-Hausdorff method can be used for measuring the similarity of DNA sequences based on two important tools: the Yau-Hausdorff distance and graphical representation of DNA sequences. The graphical representations of DNA sequences conserve all sequence information and the Yau-Hausdorff distance is mathematically proved as a true metric. Therefore, the proposed distance can preciously measure the similarity of DNA sequences. The phylogenetic analyses of DNA sequences by the Yau-Hausdorff distance show the accuracy and stability of our approach in similarity comparison of DNA or protein sequences. This study demonstrates that Yau-Hausdorff distance is a natural metric for DNA and protein sequences with high level of stability. The approach can be also applied to similarity analysis of protein sequences by graphic representations, as well as general two dimensional shape matching.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

APPLICATION OF TWO-DIMENSIONAL ELECTROPHORESIS AND NIH 3T3 CELL TRANSFECTION ASSAY IN THE STUDY OF TUMOR-AS SOCIATED PROTEINS AND GENOMIC DNA TUMOROGENICITY IN MALIGNANT HUMAN ESOPHAGEAL SPECIMENS

Total protein and DNA extracted from histologically diagnosed normal nonmalignant and esophageal tumor tissues were used for analysis of polypeptides pattern by two-dimensional gel electrophoresis and DNA transforming activity in NIH 3T3 cell transfection assay, respectively. In comparison to normal tissues, eight polypeptides underwent down-regulation or disappeared, while seven polypeptid...

متن کامل

An Evolutionary and Phylogenetic Study of the BMP15 Gene

DNA sequence data contains a wealth of biologically useful information. Recent innovations in DNA sequencing technology have greatly increased our capacity to determine massive amounts of nucleotide sequences. These sequences can be used to specify the characteristics of different regions, interpret the evolutionary relationships between categorized groups, likelihood of performing multiple com...

متن کامل

Comparison of Three-Dimensional Double-Echo Steady-State Sequence with Routine Two-Dimensional Sequence in the Depiction of Knee Cartilage

Introduction: There are some routine two-dimensional sequences, including short tau inversion recovery (STIR), T2-weighted fast-spin echo (T2W-FSE), and proton-density fast spin-echo for diagnosing osteoarthritis and lesions of the knee cartilage. However, these sequences have some disadvantages, such as long scan time, inadequate spatial resolution, and suboptimal tis...

متن کامل

Study on Genetic Diversity of Terminal Fragment Sequence of Isolated Persian Tobacco Mosaic Virus

Tobacco mosaic virus (TMV) is one of the devastating plant viruses in the world that infects more than 200 plant species. Movement protein plays a supportive role in the movement of other plant viruses, and viral coat protein is highly expressed in infected plants and affects replication and movements of TMV. In order to investigate genetic variation in the terminal fragment sequence in Iranian...

متن کامل

Identification of Novel Mutations in IL-2 Gene in Khorasan Native Fowls

The intron-exon structure of Khorasan native fowl interleukin-2 (IL-2) was investigated. For this purpose, twenty chickens were selected from the Native Fowl Breeding Station of Khorasan province, and genomic DNA was extracted using a modified conventional DNA extraction protocol. An 875 bp fragment of IL-2 was successfully amplified, including a small part of the promoter, exon 1, intron 1, an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2015